# High-fidelity audio
Voxpolska V1 Merged 16bit
Apache-2.0
VoxPolska is an advanced model focused on Polish text-to-speech conversion, capable of generating natural, fluent, and expressive Polish speech.
Speech Synthesis
Transformers Other

V
salihfurkaan
116
1
Inspiremusic Base
Apache-2.0
InspireMusic is a unified toolkit focused on music generation, song generation, and audio generation, featuring high audio quality and long-form music generation capabilities.
Audio Generation
Safetensors English
I
FunAudioLLM
60
10
MP SENet DNS
MIT
An audio denoising and voice enhancement model based on Pytorch, which effectively removes audio noise and improves voice clarity
Audio Enhancement
Safetensors
M
JacobLinCool
723
1
Musicgen Stereo Melody Large
MusicGen is a text-to-music generation model that supports stereo and melody guidance, capable of producing high-quality music samples based on text descriptions or audio prompts.
Audio Generation
Transformers

M
facebook
61
47
Bark Small
Bark is a Transformer-based text-to-audio model created by Suno, capable of generating highly realistic multilingual speech, music, background noise, and simple sound effects.
Speech Synthesis
Transformers Supports Multiple Languages

B
ylacombe
1,947
2
Tts Transformer Zh Cv7 Css10
A Transformer-based text-to-speech model built on fairseq S^2, supporting Simplified Chinese with a single female voice, trained on Common Voice v7 and CSS10 datasets.
Speech Synthesis Chinese
T
facebook
15
85
Featured Recommended AI Models